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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Quertsrmous, Thomas 

Kogan, Brigid 
Snodgrass, H. Ralph 
Zupancic, Thomas J. 

(ii) TITLE OF INVENTION : DEVELOPMENT ALLY -REGULATED ENDOTHELIAL 
CELL LOCUS -1 

(iii) NUMBER OF SEQUENCES: 2 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 
f J» (D) STATE: New York 

Hi (E) COUNTRY: United States 

hj (F) ZIP: 10035-2711 



ri 
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(v) COMPUTER READABLE FORM: .' S 
fi (A) MEDIUM TYPE: Floppy disk 

lH (B) COMPUTER: I3M PC compatible 

U (C) OPERATING SYSTEM: PC -DOS /MS -DOS 1 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



H (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To 3e Assigned 
(3) FILING DATE: 05-JUN-199S 
III (C) CLASSIFICATION: 

? ^ ( viii ) ATTORNEY/ AGENT INFORMATION: 

*fl (A) NAME: Poissant , Brian M . 

(3) REGISTRATION NUM3SR : 23,4 52 

(C) REFERENCE /DOCKET NUMBER : 8907-034 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE : (212) 790-9090 
(3) TELEFAX: (212) 859-8854/9741 
(C) TELEX: 65141 Pennie 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNES3 : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly Xaa 
1 5 " 10* 15' 



-71- 



r 

t 



Xaa Ala Lvs Asp Phe Gly Asp Val Leu ?he Val Gly Ser Tyr Lys Leu 

20 25 30 

Ala .Tyr Ser Asn Aso Gly Glu His Tro Met Val His Gin Asp Glu Lys 
35 " * 40 45 

Gin Arg Lvs Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His 
50 ~ " "55 60 

Arg Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie 
65 ' 70 75 80 

Leu Pro Leu Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO:2: 

(1} SEQUENCE CHARACTERISTICS: 
y (A) LENGTH: 85 amino acids 



?B (B) TYPE: amino acid 

flj ( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 



1 



lit 



if* 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Aso Leu Gly Ser Ser Lys Glu Val Thr Gly lie lie Thr Gin Gly Xaa 
1 * 5 1(T 15 

Xaa Ala Arg Asn Phe Gly Ser Val Gin Phe Val Ala Ser Tyr Lys Val 

20 25 30 

Ala Tyr Ser Asn Aso Ser Ala Asn Trr> Thr Glu Tyr Gin Asp Pro Arg 
3 5 " 40 4 5 

Thr Gly Ser Ser Lys Val Phe Gin Gly Asn Leu Asp Asn Asn Ser His 
50 55 60 

Lvs Lvs Asn lie Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val 
65 70 7 5 80 

Leu Pro Val Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 3 : 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



t 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 
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(2) INFORMATION FOR SEQ ID NO: 4: 

?!3 

|1j (i) SEQUENCE CHARACTERISTICS: 

j : ; (A) LENGTH: 85 amino acids 

yi (B) TYPE: amino acid 

""'i (C) STRANDEDNSSS: 

?fl (D) TOPOLOGY: unknown 



t*55 



!k5 
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Asp Leu Leu Lys lie Lys Lys lie Thr Ala lie lie Thr Gin Gly Xaa 
1 " 5 10 15 

Xaa Cys Lys Ser Leu Ser Ser Glu Met Tyr Val Lys Ser Tvr Thr lie 

20 2 5 30 

His Tyr Ser Glu Gin Gly Val Glu Tro Lys Pro Tyr Arg Leu Lys Ser 
35 40 45 

Ser Met Val Asp Lys lie Phe Glu Gly Asn Thr Asn Thr Lys Gly His 
50 55 60 

Val Lys Asn Phe Phe Asn Pro Pro lie lie Ser Arg Phe lie Arg Val 
65 70 75 *" 80 

lie Pro Lys Xaa Xaa 

85 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Leu Gin Lys Thr Met Lys Val Thr Gly lie lie Thr Gin Gly Xaa 
15 10 15' 

•Xaa Val Lys Ser Leu Phe Thr Ser Met Phe Val Lvs Glu Phe Leu lie 

20 25 " 30 

Ser Ser Ser Gin Asp Gly His His Trp Thr Xaa Xaa Gin lie Leu Tyr 
35 " 40 45 

Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Aso Ser Ser Thr Pro 
50 5 5 60~ 

Met Met Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Xaa Xaa Xaa Xaa 
65 70 75 " 80 

Xaa Xaa Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNSSS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



> 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Aso Leu Glu Asn Leu Arg ?he Val Ser Gly lie Gly Thr Gin Gly Ala 
1 V 5 10* 15 

lie Ser Lvs Glu Thr Lys Lys Lys Tyr ?he Val Lys Ser Tyr Lys Val 

20 ' 25 30 

Asd lie Ser Ser Asn Gly Glu Asp Trp lie Xaa Xaa Thr Leu Lys Gly 
35 40* 45 

Aso Asn Lys His Leu Val Phe Thr Gly Asn Thr Asp Ala Thr Asp Val 
50 55 60 

Val Tyr Arg Pro Phe Ser Lys Pro Val He Thr Arg Phe Val Arg Leu 
65 " " 70 75 30 

Arg Pro Val Thr Trp 

85 

(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
W (C) STRANDEDNESS: 

M ( D ) TOPOLOGY: unknown 

i*~ (ii) MOLECULE TYPE: peptide 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Aso Leu Ala Glu Glu Lvs He Val Arg Gly Val He He Gin Gly Xaa 
15 10 15 

Xaa Gly Lys His Lys Glu Asn Lys Val Phe Met Arg Lys Phe Lys He 

20 25 30 

Gly Tyr Ser Asn Asn Gly Thr Glu Trp Glu Met He Met Asp Ser Ser 
35 40 45 

Lys Asn Lys Pro Lys Thr Phe Glu Gly Asn Thr Asn Tyr Asp Thr Pro 
50 " " 5 5 60 

Glu Leu Arg Thr Phe Xaa Ala His He Thr Thr Gly Phe He Arg He 
65 " 70 75 80 

He Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) HOMOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glv Cys Glu Val Pro Arg Thr Phe Met Cvs Val Ala Leu Gin Glv Xaa 
1 5 10 15* 

Xaa Xaa Arg Gly Xaa Asp Ala Asp Gin Trp Val Thr Ser Tyr Lys lie 

20 25 30 

Arg Tvr Ser Leu Asp Asn Val Ser Trp Phe Xaa Xaa Xaa Xaa Xaa Glu 
35 " 40 45 

Tyr Arg Asn Glv Ala Ala lie Thr Gly Val Thr Asp Arg Asn Thr Val 
50 5 5 * 50 

Val Asn His Phe Phe Asp Thr Pro lie Arg Ala Arg Ser lie Ala He 
65 70 75 80 



*H His Pro Leu Thr Xaa 

85 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 85 amino acids \ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 



1»* (ii) MOLECULE TYPE: peptide 

■ »f * 

V4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Aso Leu Xaa Xaa Xaa Xaa Xaa Val Thr Gly He He Thr Gin Gly Xaa 
1 5 10 15 

Xaa Xaa Lvs Xaa Xaa Xaa Xaa Xaa Xaa Phe Val Xaa Ser Tyr Lys He 

20 25 30 

Xaa Tyr Ser Xaa Asp Gly Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 ' 40 45 

Xaa Xaa Lys Xaa Lys Val Phe Xaa Gly Asn Thr Asp Xaa Xaa Thr Xaa 
50 ' 55 50 

Xaa Xaa Asn Xaa Phe Xaa Xaa Pro He Xaa Xaa Arg Phe He Arg Xaa 
55 70 75 80 

Xaa Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2303 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 619.. 2058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCGGT TAACTGAGGA CAAAGGGTAA TGCAGAAGTG ATATTTGATT TCCATTCTCA 

TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 

CCTGCGTCTC ATATTTCTGC ATGCTGCTTT GTTTGTATAT AGTGCGCTCC TGGCCTCAGG 

CTCGCTCCCC TCCAGCTCTC GCTTCATTGT TCTCCAAGTC AGAAGCCCCC GCATCCGCCG 

CGCAGCAGCG TGAGCCGTAG TCACTGCTGG CCGCTTCGCC TGCGTGCGCG CACGGAAATC 

GGGGAGCCAG GAACCCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 

AGCCCCAGCC TCTCTCAAGC GCACCCACCT CCGCGCACCC CAGCTCAGGC GAAGCTGGAG 

TGAGGGTGAA TCACCCTTTC TCTAGGGCCA CCACTCTTTT ATCGCCCTTC CCAAGATTTG 

AGAAGCGCTG CGGGAGG AAA GACGTCCTCT TGATCTCTGA CAGGGCGGGG TTTACTGCTG 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA GCCCAGGTCA 

CGTCCGTGAG AAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG GTT 

Met Lvs His Leu Val Ala Ala Tro Leu Leu Val 
1 5 10 

GGA CTC AGC CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC 
Gly Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn 

15 " 20 " " "25 

CCG AAC CCC TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT 
Pro Asn Pro Cys Glu Asn Gly Glv lie Cys Leu Ser Gly Leu Ala Asp 
30 " 35 40 

GAT TCC TTT TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG AAC TGC 
Asp Ser Phe Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys 
45 50 ' 55 

TCT AGT GTT GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA 
Ser Ser Val Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala 
60 65 70 " 75 

GGT CCC TGC ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA 
Gly Pro Cys lie Pro Asn Pro Cys His Asn Gly Gly Thr Cvs Glu lie 

80 85 " " ' 90 

AGC GAA GCC TAT CGA GGA GAC AC A TTC ATA GGC TAT GTT TGT AAA TGT 
Ser Glu Ala Tyr Arg Gly Asp Thr Phe lie Gly Tyr Val Cvs Lys Cys 

9 5 * 100 * ' 105- 

CCT CGG GGA TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT 
Pro Arg Gly Phe Asn Gly lie His Cys Gin His Asn He Asn Glu Cys 
110 115 120 

GAA GCT GAG CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT 
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Asn Pro Cys His Asn Gly Oly Thr Cys Glu He Ser Glu Ala Tyr Arg 

8 5 

Glv Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asa 

Gly lie His Cys Gin His Asa lie Asn Glu Cys Glu Ala Glu Pro Cys 

115 120 
Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 

130 135 
Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly 
t 45 150 

is Leu Gly lie Glu Gly Gly He He Ser Asn Gin Gin He Thr Ala 

165 170 
Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr 



145 
H 



180 185 



Ty 



r Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr Ala Ala 



210 



195 

Glu Asn Asp Arg Trp Pro Trp He Gin He Asn Leu Gin Arg Lys Met 



Arg val Thr Gly Val lie Thr Gin Gly Ala Lys Arg He Gly Ser Pro 



225 

Glu Tyr He Lys Ser Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys Thr 

245 25U 



230 



Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Glu Met Val Phe Arg 



260 

Gly Asn val Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro 

275 2BU 
lie Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin lie Cys Arg Arg Hi: 



290 



295 



300 



Cvs 
305 



M . rll , Tpu Leu Glv Cvs Glu Leu Ser Gly Cys Ser 
Thr Leu Arg Met Glu Leu Leu ^J.y w 32Q 



310 



Glu Pro Leu Gly Met Lys Ser Gly His He Gin Asp Tyr Gin He Thr 



325 330 



Ala Ser Ser Val Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu 

340 343 

Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr 

355 3d0 
Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val 



370 



375 



Pro Thr Lys Val Thr Gly He He Thr Gin Gly Ala Lys Asp Phe Gly 



385 



390 



_ . _ _ r«,,^ r uc tpu Ala T V r Ser Asn Asn Gly 
His Val Gin Phe Val Gly Ser Tyr ^ys Leu A.a -y- 

405 4iu 



Glu His Tro Met Val His Gin Aso Glu Lvs Gin Arg Lys Asp Lys Val 

420 " 42 5 430 



Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asp 

435 440 " * 445 

Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu Pro Trp Ser Trp Tyr 

450 '45 5 " 460 

Gly Arg lie Thr Leu Arg Ser Glu Leu Leu Gly Cys Ala Glu Glu Glu 

465 470 475 " 480 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(3 ) LOCATION: 1 . . 1779 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCT CTT TAG TCA CCA CTC TCG CCC TCT CCA AGA ATT TGT TTA ACA AAG 

Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lys 

15 10 - 15 

CGC TGA GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT 

Arg * Gly Lys Arg Thr Ser Ser * lie Leu * * Glv Arg Ser 

20 " 25 30 

CTG CTG CTG CCC TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG 

Leu Leu Leu Pro Cys Ala Ala Thr Ser Ala Thr Leu Pro Ser Ala Thr 
35 40 45 

ACC CCT GAC CAG CCG GGG TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC 

Thr Pro Asp Gin Pro Gly Ser Arg Pro Gly Aso Glv lie Met Lvs Arg 
50 55 60 

TCG GTA GCC GTC TGG CTC TTG GTC GGG CTC AGC CTC GGT GTC CCC CAG 

Ser Val Ala Val Tro Leu Leu Val Gly Leu Ser Leu Gly Val Pro Gin 

65 70 75 " 80 

TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT 

?he Gly Lys Gly Aso lie Cys Aso Pro Asn Pro Cvs Glu Asn Glv Glv 

S5 90 " 95 

ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA 

lie Cys Leu Pro Glv Leu Ala Val Glv Ser Phe Ser Cvs Glu- Cvs Pro 

100 ' 105 110 

GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT GCA TCA 

Asp Gly Phe Thr Asp Pro Asn Cvs Ser Ser Val Val Glu Val Ala Ser 

115 120 125 
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GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA TGC 432 
Asp Glu Glu Glu Pro Thr Ser Ala Gly Pro Cys Thr Pro Asn Pro Cys 
130 135 140 

CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA TAC CGA GGG GAT ACA 480 
His Asn Gly Gly Thr Cys Glu He Ser Glu Ala Tyr Arg Gly Asp Thr 
145 150 155 160 

TTC ATA GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC 528 
Phe He Gly Tyr Val Cys Lvs Cys Pro Arg Gly Phe Asn Gly He His 

165 " 170 175 

TGT CAG CAC AAC ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT 57 6 

Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cys Lys Asn Gly 

180 1 185 190 

GGA ATA TGT ACA GAT CTT GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC 624 

Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu Cys Pro Gly 

□ 195 200 205 

ifl 

f{\ GAA TTT ATG GGA AGA AAT TGT CAA TAC AAA TGC TCA GGC CCA CTG GGA 57 2 

Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly Pro Leu Gly 

210 ' " 215 220 



1JJ 

7 



ifl ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA ATC ACA GCT TCC TCT ACT 720 

He Glu Gly Gly He He Ser Asn Gin Gin He Thr Ala Ser Ser Thr 
225 " 230 235 240 



l ; 
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H CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC TAC TAT GCA CGT 768 

O His Arg Ala Leu Phe Gly Leu Gin Lvs Tro Tvr Pro Tyr Tyr Ala Arg 

245 " * 250 255 



5« 



!;f CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA AAT GAC 816 

Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr Ala Ala Glu Asn Asp 

260 265 270 



AGA TGG AAG CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 364 

Arg Trp Lys Arg Trp He Gin lie Asn Leu Glr. Arg Lys Met Arg Val 

275 * 280 235 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT 912 

Thr Gly Val He Thr Gin Gly Ala Lys Arg He Gly Ser Pro Glu Tyr 

290 295 300 

ATA AAA TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA 960 

He Lys Phe Tyr Lys He Ala Tyr Ser Asn Aso Gly Lvs Thr Trp Ala 

305 " " 310 315 * ' 320 

ATG TAC AAA GTQ AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC 1008 

Met Tyr Lys Val Lys Gly Thr Asn Glu Aso Met Val Phe Arg Gly Asn 

325 * 330 " 335 

ATT GAT AAC AAC ACT CCA TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA 105 5 

He Asp Asn Asn Thr Pro Tvr Ala Asn Ser Pha Thr Pro Pro lie Lys 

340 " 345 350 

GCT CAG TAT GTA AGA CTC TAT CCC CAA GTT TGT CGA AGA CAT TGC ACT 1104 

Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg His Cys Thr 

35 5 " " 350 ~ 355 

TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG TCG GGT TGT TCT GAG CCT 1152 

Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cys Ser Glu Pro 

370 375 330 
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CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG ATC ACT GCC TCC 
Leu Gly Met Lys Ser Gly His lie Gin Asp Tyr Gin lie Thr Ala Ser 
385 " 390 395 400 

AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC ACT TGG GAA CCA AGG 
Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu Pro Arg 

405 410 415 

AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 
Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr Ser Gly 

420 * " * 425 430 

CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC 
His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val Pro Thr 
435 440 445 

AAA GTG ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA 
Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe Gly His Val 
450 ^ 455 460 

CAG TTT GTT GGC TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC 
Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp Gly Glu His 
465 " 470 " 475 " 480 

TGG ACT GTA TAC CAG GAT GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG 
Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys Val Phe Gin 

485 " * 490 ' * 495 

GGA AAT TTT GAC AAT GAC ACT CAC AGA AAA AAT GTC ATC GAC CCT CCC 
Gly Asn Phe Asp Asn Aso Thr His Arg Lvs Asn Val lie Aso Pro Pro 

500 " 505 510 

ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT TGG TCC TGG TAC GGG AGG 
lie Tyr Ala Arg His He Arg He Leu Pro Tro Ser Trp Tyr Gly Arg 
515 ' 520 ~ 525 

ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG GAG GAA TGA GGG 
He Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu Glu * Gly 
530 535 540 

GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT ATC TCC 
Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser He Ser 
545 5 50 55 5 560 

ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 
Met Glu * Thr Val * Asn Leu * Glu Thr Glu Trp Phe Phe Phe 

565 570 575 

TTT TCA TGA AAA AGT GGT CAA ATT ATG GTA GGC AAC TAA CGG TGT TTT 
Phe Ser * Lvs Ser Gly Gin He Met Val Gly Asn * Arg Cvs Phe 

580 585 590 

TAC C 
Tvr 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



-32- 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lys 
1 5 10 * 15 

Arg 

INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Glv Lys Arg Thr Ser Ser 
1 " 5 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Leu * * Gly Arg Ser Leu Leu Leu Pro Cvs Ala Ala Thr Ser 
1 5 10 15 

Ala Thr Leu Pro Ser Ala Thr Thr Pro Aso Gin Pro Glv Ser Arg Pro 

20 25 30 

Gly Asp Gly He Met Lys Arg Ser Val Ala Val Trp Leu Leu Val Gly 
35 40 ' 45 

Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Asp He Cys Asp Pro 
50 55 ~ 60* 

Asn Pro Cys Glu Asn Gly Gly He Cys Leu Pro Glv Leu Ala Val Gly 
65 70 7 5 80 

Ser Phe Ser Cys Glu Cys Pro Asp Glv Phe Thr Aso Pro Asn Cys Ser 

35 90 95 

Ser Val Val Glu Val Ala Ser Asp Glu Glu Glu Pro Thr Ser Ala Gly 

100 ~ 105 110 

Pro Cys Thr Pro Asn Pro Cys His Asn Gly Glv Thr Cvs Glu He Ser 
115 ' 120 " ' 1-25 

Glu Ala Tyr Arg Gly Aso Thr Phe He Gly Tvr Val Cvs Lys Cvs Pro 
130 " 135 ' " 140 

Arg Gly Phe Asn Gly He His Cvs Gin His Asn He Asn Glu Cys Glu 
145 150 155 160 
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Val Glu Pro Cys Lys Asn Gly Gly lie Cvs Thr Aso Leu Val Ala Asn 

165 170 175 



Tvr Ser Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr 

180 * " 185 ' ~ 190 

Lys Cys Ser Gly Pro Leu Gly lie Glu Gly Gly lie lie Ser Asn Gin 

195 * * 200 ' 205 

Gin lie Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys 

210 215 220 

Tro Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu lie Asn Ala 

225 " 230 235 240 

Tro Thr Ala Ala Glu Asn Asd Arg Trp Lys Arg Tru lie Gin lie Asn 

245 250 " 255 

Leu Gin Arg Lys Met Arg Val Thr Gly Val lie Thr Gin Gly Ala Lys 

260 265 270 

Arg lie Gly Ser Pro Glu Tyr lie Lys Phe Tyr Lys lie Ala Tyr Ser 

27 5 2 80 2 85 

Asn Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu 

290 " 295 300 

Aso Met Val Phe Arg Gly Asn lie Asp Asn Asn Thr Pro Tyr Ala Asn 

305 " 310 315 320 

Ser Phe Thr Pro Pro He Lvs Ala Gin Tyr Val Arg Leu Tyr Pro Gin 

325 " 330 " " 335 

Val Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Giu 

340 345 350 

Leu Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin 

355 350 365 

Asp Tyr Gin He Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Asp 

370 375 380 

Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys 

385 390 " " " 398 " " 400 

Val Asn Ala Trp Thr Ser Gly His Asn Aso Gin Ser Gin Tro Leu Gin 

405 " 410 " 415 

Val Aso Leu Leu Val Pro Thr Lys Val Thr Gly He He Thr Gin Gly 

420 425 430 

Ala Lys Aso Phe Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala 

435 ~ 440 " 445 

Tyr Ser Asn Asp Gly Giu His Trp Thr Val Tyr Gin Asp Giu Lys Gin 

450 " 455 450 

Arg Lys Aso Lys Val Phe Gin Gly Asn Phe Aso Asn Aso Thr His Arg 

465 * * 470 * 475 480 



Lys Asn Val lie Aso Pro Pro He Tyr Ala Arg His He Arg He Leu 

485 " 490 " 495 



Pro Trp Ser Trp Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly 

500 ' ~ 505 510 



[ 0 « 



i 



i 



3 t 



Cys Thr Glu Glu Glu 
515 

(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie 
1 5 10 15 

Ser Met Glu * Thr Val * Asn Leu 

20 25 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
« (D) TOPOLOGY: linear 

Ck (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



ifi Glu Thr Glu Tro Phe Phe Phe Phe Ser 

*a 1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lvs Ser Gly Gin lie Met Val Gly Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
(By TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Cys Phe Tyr 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



u 
>I3 

: z i 

y ■ • 

? y 

w 


GACAGATGGC 


CATGGATTCA 


GATAAATTTG 


CAAAGAAAAA 


TGAGAGTCAC 


TGGTGTTATT 


60 


ACCCAAGGAG 


CAAAAAGGAT 


TGGAAGCCCA 


GAGTACATAA 


AATCCTACAA 


AATTGCCTAC 


120 


AGCAATGACG 


GGAAGACCTG 


GGCAATGTAC 


AAAGTAAAAG 


GCACCAATGA 


AGAGATGGTC 


180 


v ! "? 
r ~K 

;i?3 


TTTCGTGGAA 


ATGTTGATAA 


CAACACACCA 


TATG CTAATT 


CTTTCACACC 


C C C AAT C AAA 


240 


" S3 

{«A 


GCTCAGTATG 


TAAGACTCTA 


CCCCCAAATT 


TGTCGAAGGC 


ATTGTACTTT 


AAGAATGGAA 


300 




CTTCTTGGCT 


GTGAGCTC 










318 



?S3 

H (2) INFORMATION FOR SEQ ID NO:20: 



(i) SEQUENCE CHARACTERISTICS: 
j *i (A) LENGTH: 320 amino acids 

*Ji (B) TYPE: amino acid 

?j] (C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Cys Ser Thr Gin Leu Gly Met Glu Gly Gly Ala lie Ala Asp Ser Gin 

l - 5 10 15 

lie Ser Ala Ser Tvr Val Tyr Met Glv Phe Met Glv Leu Gin Arg Tro 

20 - 25" 30 

Gly Pro Glu Leu Ala Arg Leu Tyr Arg Thr Gly lie Val Asn Ala Trp 

35 40 * 45 

His Ala Ser Asn Tyr Asp Xaa Ser Lys Pro Trp lie Gin Val Asn Leu 

50 " 55 " " 60 

Leu Arg Lys Met Arg Val Ser Gly Val Met Thr Gin Gly Ala Ser Arg 

55 " 70 75 30 

Ala Gly Arg Ala Glu Tvr Leu Lys Thr Phe Lys Val Ala Tvr Ser Leu 

8 5 90 95 
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Aso Gly Xaa Arg 

100 

Lys. Glu Phe Leu 
115 

Phe Asn Pro Thr 
130 

Cys His Arg Gly 
145 

His Gly Cys Leu 



Ser Gin Met Ser 

180 

Phe Gly Trp Tyr 
195 

Asn Ala Trp Thr 
210 

Asr> Leu Gly Thr 
225 

Arg Asp Phe Gly 



Ser Ast> Asp Gly 

260 

Ser Ser Lys Val 
275 

Asn lie Phe Glu 
290 

Val Ser Trp His 
305 



Lys Phe Glu Phe 



Glv Asn Leu Aso 

120 

Leu Glu Ala Gin 
135 

Cys Thr Leu Arg 
150 

Glu Pro Leu Glv 
165 

Ala Ser Ser Ser 



Pro His Leu Glv 

200 

Ala Gin Ser Asn 
215 

Gin Arg Gin Val 
230 

His lie Gin Tyr 
245 

Val Gin Trp Thr 



Phe Gin Glv Asn 

230 

Lys Pro Phe Met 
295 

Asn Arg lie Thr 
310 



lie Gin Asd Glu 
105 

Asn Asn Ser Leu 



Tyr lie Arg Leu 

140 

Phe Glu Leu Leu 
155 

Leu Lys Asn Asn 
170 

Tyr Lys Thr Trp 
185 

Arg Leu Asp Asn 



Ser Ala Lys Glu 

220 

Thr Gly lie lie 
235 

Val Glu Ser Tyr 
250 

Val Tvr Xaa Xaa 
265 

Leu Asp Asn Asn 



Ala Arg Tyr Val 

300 

Leu Arg Leu Glu 
315 



Ser Gly Gly Asp 
110 

Lvs Val Asn Met 
125 

Tyr Pro Val Ser 



Glv Cys Glu Leu 

160 

Thr lie Pro Asp 
175 

Asn Leu Arg Ala 
190 

Gin Gly Lys lie 
205 

Trp Leu Gin Val 



Thr Gin Glv Ala 

240 

Lys Val Ala His 
255 

Glu Glu Gin Gly 
270 

Ser His Lys Lys 
235 

Arg Val Leu Pro 



Leu Leu Gly Cvs 

320 



INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 321 amino acids 
<B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

' Cys Ser Glv Pro Leu Glv lie Glu Glv Glv lie He Ser Asn Gin Gin 
1 ^5 10" 15 

He Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp 

20 25 30 



Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu lie Asn Ala Trp 
3 5 ~ 40 4 5 



i-2 



T ; ; 



f. i 



111 



Thr . Ala Ala Giu Asn Asp Arg Trp Asn Arg Trp lie Gin lie Asn Leu 
50 ~ 55 60 

Gin Arg Lys Met Arg Val Thr Gly Val lie Thr Gin Gly Ala Lys Arg 
65 " 70 75 * 80 

lie Gly Ser Pro Glu Tyr lie Lys Phe Tyr Lys lie Ala Tyr Ser Asn 

85 90 ' 95 

Asp Gly Lys Thr Tro Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Aso 

100 * 105 110 

Met Val Phe Arg Gly Asn lie Asp Asn Asn Thr Pro Tyr Ala Asn Ser 
115 " 120 12 5 

Phe Thr Pro Pro lie Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val 
130 135 " ' 140 

Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu 
145 " 150 155 " 160 



?|2 Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His lie Gin Asd 

m ' ^ 165 170 175 



Tyr Gin lie Thr Ala Ser Ser lie Phe Arg Thr Leu Asn Met Asp Met 

ISO 185 190 

Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val 
195 200 205 

Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val 
210 " ~ 215 * 220 

Xaa Leu Leu Val Pro Thr Lvs Val Thr Gly lie lie Thr Gin Gly Ala 
225 230 235 240 

Lys Aso Xaa Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr 

245 250 255 

Ser Asn Asp Gly Glu His Trp Thr Val Xaa Gin Asp Glu Lys Gin Arg 

260 265 270 

Lys Asp Lys Val Xaa Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys 
27 5 ~ 280 285 

Asn Val lie Aso Pro Pro lie Tyr Ala Arg His lie Arg lie Leu Pro 
290 295 " 300 

Tro Ser Tro Tyr Glv Arg He Thr Leu Ala Ser Giu Leu Leu Glv Cys 
305 " " " 310 315 ' 320 

Thr 



(2) INFORMATION FOR SSQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
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( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Lys Arg Ser Val Ala Val Trr> Leu Leu Val Glv Leu Ser Leu Gly 
1 5 10 15 

Val Pro Gin Phe Gly Lys Gly Asp He 

20 25 

INFORMATION FO£ SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly He Cys Leu Pro Gly Leu 

1 5 10 15 

Ala Val Gly Xaa Xaa Xaa Xaa Xaa Ser Phe Ser Cys Glu Cys Pro Asp 

20 25 ' 30 

Glv Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser Asp 

35 ^ 40 45 

Glu Glu Glu Pro Thr Ser Ala Glv Pro 

50 55 



INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Cvs Thr Pro Asn Pro Cvs His Asn Glv Gly Thr Cvs Glu He Ser Glu 
* 1 5 10 15 < 



Ala Tvr Arg Glv Asn Thr Phe He Gly Tv'r Val Cys Lvs Cvs Pro Arg 

20* " 2 5 30 



Gly Phe Asn Gly lie His Cys Gin His Asn lie 
35 ' 40 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

P Cys Glu Val Glu Pro Cys Lvs Asn Glv Gly lie Cys Thr Asp Leu Val 

ill 1 5 10 15 



hi 



» ■ 



it. 



Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tyr Ser Cys Glu Cys Pro Gly 

20 25 30 



#|3 Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys 

M 35 40 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 
L| (A) LENGTH: 40 amino acids 

{== (B) TYPE: amino acid 

\[* (C) STRANDEDNESS : 

Ml (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



\ . 

i 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 

Cys Xaa Xaa Xaa Pro Cvs Xaa Asn Gly Gly Xaa Cys Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Cys Xaa Cys Xaa Xaa 

20 25 * 30 

Glv Tyr Xaa Gly Xaa Xaa Cvs Xaa 
3 5 40 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix ) FEATURE : 
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(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 309 




-91- 



il 

til 

m 

i . | 

« - * 



?2H 



111 
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GACACATTCA TAGGCTATGT TTGTAAATGT CCTCGGGGAT TTAATGGGAT TCACTGTCAG 900 

CACAATATAA ATGAATGTGA AGCTGAGCCT TGCAGAAATG GCGGAATATG TACCGACCTT 960 

GTTGCTAACT ACTCTTGTGA ATGCCCAGGA GAATTTATGG GACGA-AATTG TCAATATAAA 1020 

TGCTCTGGGC ACTTGGGAAT CGAAGGTGGG ATCATATCTA ATCAGCAAAT CACAGCTTCA 1080 

TCTAATCACC GAGCTCTTTT TGGACTCCAG AAGTGGTATC CCTACTATGC TAGACTTAAT 1140 

AAGAAGGGCC TTATAAATGC CTGGACAGCT GCTGAAAATG ACAGATGGCC ATGGATTCAG 1200 

GTAACAGTGG GATGAGACAA ATCCATTTCC CAAATTATCA GAATCATTAT AGAAGTAGGT 1260 

TAGGGAGAAT TGGCTGTGAT TCTTTCTCAT GGTTAAAATG TGATTTAGTT CAGAATTAAC 1320 

ATGGTTGGAA ACTCTAAAAA ATGTGGAAAA CAGGAACATT CTATGTCTGA AAAT CT G AAA 1380 

ATAGCATCAA GATGAAAACA TTCTTTAGTC ATAAATATAC TCTTTTAAGT TATAGTAGAG 1440 

AAAAAG AT CT TATCATTTCA TAAGTGGACT TTTGGGATAG CATTGGAAAT G T AAAT G AAA 1500 

TAAATACCTA ATTGAAAAAA GTTTATTCTA AAGTGTTAAT ATTTAGCAAC AGATTCAGAG 15 60 

ACAAGAAAGT AACAATTCAA TCTGTGTATT TTTTGTGAGA AATAGTTTCC CATGTGCAAA 1620 

TATAAAGTGC GCATCATATC ATGATAATAT CCAACTGTCT GCAGAACTCC CTTT CAT AAA 1680 

TGAGAGAATT TTAATTCATA GTGCCTTATA T C CT C AT C AG CCATCTGACT TTACTACAGA 1740 

AGAAAACAAT GAAATGATGC ATTAAGTGCT TTGCTAGAAG AAA CAT C AT A GCAAAGCTGA 1800 

TAGCCCACAT TCTGTGCANN NAAGCTTCCA GAGCACTCGA G AAAAAG C AG AAATGAGATG 1860 

TTTTATGAAA ACCGAAAAGA TAATCTGATT TCTGTGAAAT ATACTTTTGA TCATGTGGTT 1920 

CTTTAAGATA GTCACTAACA AGTCATTAGT AGCAGATACC AAATGGGAGA AAATTTCCAG 1980 

TATACTGAGG GTCAAGGCAG TCATGCTGAA ACT AC AT GAG GTCAGGAAAG TTTTGAAATA 2040 

AGGTGATTTT GGAAGGATAC CTTCAACTGG CCTAGATTTT CAAGAAACAG TGTAATCAAC 2100 

AGCCAAACAT GAGAATCTAG CTAACAGCAT TTAGAAAACC AGAACTAAGA GTGTTACTGG 2160 

GG AATTG CAT TTAAATCCAG TATGAGAGTT TGCAAATGCC GTATTCTTCT AAGGGGTTTG 2220 

TGCCACATTT TGTTACCATG GAGTCCTCTG TAAGAACTTT ATT AG AT AAA T CAT CTTT AC 22S0 

ACTATAATTT GAATAAAAGC CGGAATTC 2303 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 
(3) TYRE: amino acid 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Lys Arg Ser Val Ala Val Tro Leu Leu Val Civ Leu Ser Leu Giy 
1 ~ 5 10 15 
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Val Pro Gin Phe Glv Lvs Giv Aso lie Cvs Asd Pro Asn Pro Cys GIu 

20 " 25 30 



Asn Gly Gly He Cvs Leu Pro Gly Leu Ala Val Gly Ser Phe Ser Cys 
35 ' ' 40 45 

GIu Cys Pro Asp Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu 
50 " ' 55 60 

Val Ala Ser Asp Glu Glu Glu Pro Thr Ser Ala Glv Pro Cys Thr Pro 
65 " 70 75 80 

Asn Pro Cys His Asn Glv Gly Thr Cvs Glu He Ser Glu Ala Tyr Arg 

85 90 95 

Glv Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn 

100 ' 105 110 

Glv He His Cys Gin His Asn He Asn Glu Cys Glu Val Glu Pro Cys 
115 120 125 

Lvs Asn Gly Gly He Cvs Thr Aso Leu Val Ala Asn Tyr Ser Cys Glu 
130 ' 135 140 

Cvs Pro Gly Glu Phe Met Gly Arg Asn Cvs Gin Tyr Lys Cys Ser Gly 
145 ~ 150 " 155 160 

Pro Leu Gly He Glu Gly Glv He He Ser Asn Gin Gin He Thr Ala 

165 ' ' 170 175 

Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr 

180 185 190 

Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr Ala Ala 
195 200 205 

Glu Asn Asp Arg Tro Lys Arg Trp He Gin He Asn Leu Gin Arg Lys 
210 * 215 " 220 

Met Arg Val Thr Glv Val He Thr Gin Gly Ala Lys Arg He Gly Ser 
225 230 235 240 

Pro Glu Tyr He Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys 

245 " 250 ' 255 

Thr Trp Ala Met Tyr Lys Val Lvs Gly Thr Asn Glu Asp Met Val Phe 

260 265 270 

Arg Gly Asn He Aso Asn Asn Thr Pro Tvr Ala Asn Ser Phe Thr Pro 
275 " 230 285 

Pro He Lys Ala Gin Tvr Val Arg Leu Tvr Pro Gin Val Cys Arg Arg 
290 ~ 295 * 300 

His Cvs Thr Leu Arg Met Glu Leu Leu Glv Cvs Glu Leu Ser Gly Cys 
305 " " 310 " 315 320 

Ser Glu Pro Leu Glv Met Lvs Ser Glv His He Gin Asp Tyr Gin He 

325 330 " 335 

Thr Ala Ser Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp 

340 " 345 350 



Glu Pro Arg Lvs Ala 
355 

Thr Ser Gly His Asn 
370 



Val Pro Thr Lys Val 
385 

Gly His Val Gin Phe 

405 

Gly Glu His Tro Thr 

420 

Val Xaa Gin Glv Asn 
43 5 

Asp Pro Pro lie Tvr 
450 

Tyr Glv Arg lie Thr 
465 

Glu 
480 



Arg Leu Aso Lvs Gin Glv 
360 

Aso Gin Ser Gin Tro Leu 
37 5 

Thr Gly He He Thr Gin 
390 395 

Val Gly Ser Tvr Lys Leu 

410 

Val Xaa Gin Aso Glu Lys 

425 

Phe Asp Asn Aso Thr His 
440 

Ala Arg His He Arg He 
455 

Leu Ala Ser Glu Leu Leu 
470 475 



Lys Val Asn Ala Trp 
365 

Gin Val Xaa Leu Leu 
330 

Gly Ala Lys Aso Xaa 

400 

Ala Tyr Ser Asn Aso 

415 

Gin Arg Lvs Aso Lvs 
430 

Arg Lys Asn Val He 
445 

Leu Pro Tro Ser Tro 
4 60 

Gly Cys Thr Glu Glu 



SEQUENCE LISTING 



(1) GENE] 



INFORMATION: 



(i) APPLICANT: 




Quertermous , Thomas 
Hogan, Brigid 
Snodgrass, H. Ralph 
Zupancic, Thomas J. 

(ii) TITLE OF\lNVENTION: DEVELOPMENTALLY- REGULATED ENDOTHELIAL 
CELL LOCUS -1 

(iii) NUMBER OF SE 

( iv) CORRESPONDENCE\ADDRESS : 

(A) ADDRESSEE: ^Pennie & Edmonds LLP 

(B) STREET: 1155\ Avenue of the Americas 

(C) CITY: New Yo: 

(D) STATE: New 

(E) COUNTRY: United\States 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM 

(A) MEDIUM TYPE: Floppy 

(B) COMPUTER: IBM PC com^a£/ible 

(C) OPERATING SYSTEM: PC-Osfe/MS-DOS 

(D) SOFTWARE: Patentln Relesfe^i #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/659, N 

(B) FILING DATE: 05-JUN-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Poissant, Brian M. 

(B) REGISTRATION NUMBER: 28,462 

(C) REFERENCE/DOCKET NUMBER: 8907-0034 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-8864/9741 

(C) TELEX: 66141 Pennie 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly Xaa 
15 10 15 



5 
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is: 
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Xaa Ala Lys Asp Phe Gly Asp Val Leu Phe Val Gly Ser Tyr Lys Leu 

20 ~ 25 30 

Ala Tyr Ser Asn Asp Gly Glu His Trp Met Val His Gin Asp Glu Lys 
35 40 45 

Gin Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His 
50 55 ~ 60 

Arg Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie 
65 70 75 80 

Leu Pro Leu Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Asp Leu Gly Ser Ser Lys Glu Val Thr Gly lie lie Thr Gin Gly Xaa 
1 ~ 5 10 ' 15 

Xaa Ala Arg Asn Phe Gly Ser Val Gin Phe Val Ala Ser Tyr Lys Val 

20 25 30 

Ala Tyr Ser Asn Asp Ser Ala Asn Trp Thr Glu Tyr Gin Asp Pro Arg 
35 40 45 

Thr Gly Ser Ser Lys Val Phe Gin Gly Asn Leu Asp Asn Asn Ser His 
50 55 60 

Lys Lys Asn lie Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val 
65 70 75 80 

Leu Pro Val Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Asp Leu Leu Lys lie Lys Lys lie Thr Ala He He Thr Gin Gly Xaa 
1 5 " 10 15 

Xaa Cys Lys Ser Leu Ser Ser Glu Met Tyr Val Lys Ser Tyr Thr He 

20 25 30 

His Tyr Ser Glu Gin Gly Val Glu Trp Lys Pro Tyr Arg Leu Lys Ser 
35 40 45 

Ser Met Val Asp Lys He Phe Glu Gly Asn Thr Asn Thr Lys Gly His 
50 ~ 55 60 

Val Lys Asn Phe Phe Asn Pro Pro He He Ser Arg Phe He Arg Val 
65 70 75 80 

He Pro Lys Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TO POLOG Y : unknown 



£ ; "- 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Asp Leu Gin Lys Thr Met Lys Val Thr Gly He He Thr Gin Gly Xaa 
1 5 10 15 

Xaa Val Lys Ser Leu Phe Thr Ser Met Phe Val Lys Glu Phe Leu He 

20 25 30 

Ser Ser Ser Gin Asp Gly His His Trp Thr Xaa Xaa Gin He Leu Tyr 
35 40 45 

Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro 
50 55 60 

Met Met Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Leu Glu Asn Leu Arg Phe Val Ser Gly lie Gly Thr Gin Gly Ala 
15 10 15 

lie Ser Lys Glu Thr Lys Lys Lys Tyr Phe Val Lys Ser Tyr Lys Val 

20 25 30 

Asp lie Ser Ser Asn Gly Glu Asp Trp lie Xaa Xaa Thr Leu Lys Gly 
35 40 45 

Asp Asn Lys His Leu Val Phe Thr Gly Asn Thr Asp Ala Thr Asp Val 
50 55 60 

Val Tyr Arg Pro Phe Ser Lys Pro Val He Thr Arg Phe Val Arg Leu 
65 70 75 80 

Arg Pro Val Thr Trp 

85 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Asp Leu Ala Glu Glu Lys He Val Arg Gly Val He He Gin Gly Xaa 
1 5 10 15 

Xaa Gly Lys His Lys Glu Asn Lys Val Phe Met Arg Lys Phe Lys lie 

20 25 30 

Gly Tyr Ser Asn Asn Gly Thr Glu Trp Glu Met He Met Asp Ser Ser 
35 40 45 

Lys Asn Lys Pro Lys Thr Phe Glu Gly Asn Thr Asn Tyr Asp Thr Pro 
50 "* 55 60 

Glu Leu Arg Thr Phe Xaa Ala His He Thr Thr Gly Phe lie Arg He 
65 70 75 80 

He Pro Xaa Xaa Xaa 

85 

) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Gly Cys Glu Val Pro Arg Thr Phe Met Cys Val Ala Leu Gin Gly Xaa 
1 * 5 10 15 

Xaa Xaa Arg Gly Xaa Asp Ala Asp Gin Trp Val Thr Ser Tyr Lys lie 

20 *" 25 30 

Arg Tyr Ser Leu Asp Asn Val Ser Trp Phe Xaa Xaa Xaa Xaa Xaa Glu 
35 " 40 45 

Tyr Arg Asn Gly Ala Ala lie Thr Gly Val Thr Asp Arg Asn Thr Val 
50 ^ 55 60 

Val Asn His Phe Phe Asp Thr Pro lie Arg Ala Arg Ser lie Ala lie 
65 70 75 80 

His Pro Leu Thr Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

j« (D) TOPOLOGY: unknown 

jU (ii) MOLECULE TYPE: peptide 

i : i 

* • w+ 

* • * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



tit 



i Asp Leu Xaa Xaa Xaa Xaa Xaa Val Thr Gly lie lie Thr Gin Gly Xaa 

1 5 10 15 

Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa Phe Val Xaa Ser Tyr Lys lie 

20 25 30 

Xaa Tyr Ser Xaa Asp Gly Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Lys Xaa Lys Val Phe Xaa Gly Asn Thr Asp Xaa Xaa Thr Xaa 
50 55 ~ 60 

Xaa Xaa Asn Xaa Phe Xaa Xaa Pro lie Xaa Xaa Arg Phe lie Arg Xaa 
65 70 75 80 

Xaa Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: x 

(A) NAME /KEY : CDS 

(B) LOCATION: 619.. 2058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCCGGT TAACTGAGGA CAAAGGGTAA TGCAGAAGTG ATATTTGATT TCCATTCTCA 

TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 

CCTGCGTCTC ATATTTCTGC ATGCTGCTTT GTTTGTATAT AGTGCGCTCC TGGCCTCAGG 

CTCGCTCCCC TCCAGCTCTC GCTTCATTGT TCTCCAAGTC AGAAGCCCCC GCATCCGCCG 

CGCAGCAGCG TGAGCCGTAG TCACTGCTGG CCGCTTCGCC TGCGTGCGCG CACGGAAATC 

GGGGAGCCAG GAAC CCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 

AGCCCCAGCC TCTCTCAAGC GCACCCACCT CCGCGCACCC CAGCTCAGGC GAAGCTGGAG 

TGAGGGTGAA TCACCCTTTC TCTAGGGCCA CCACTCTTTT ATCGCCCTTC CCAAGATTTG 

AGAAGCGCTG CGGGAGGAAA GACGTCCTCT TGATCTCTGA CAGGGCGGGG TTTACTGCTG 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA GCCCAGGTCA 

CGTCCGTGAG AAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG ,GTT 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val 
1 5 10 

GGA CTC AGC CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC 
Gly Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn 

15 20 25 

CCG AAC CCC TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT 
Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp 
30 35 40 

GAT TCC TTT TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG AAC TGC 
Asp Ser Phe Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys 
45 ^ 50 55 

TCT AGT GTT GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA 
Ser Ser Val Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala 
60 65 70 75 

GGT CCC TGC ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA 
Gly Pro Cys lie Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu lie 

80 85 ' ** 90 

AGC GAA GCC TAT CGA GGA GAC ACA TTC ATA GGC TAT GTT TGT AAA TGT 
Ser Glu Ala Tyr Arg Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cys 

95 ~ 100 105 

CCT CGG GGA TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT 
Pro Arg Gly Phe Asn Gly lie His Cys Gin His Asn lie Asn Glu Cys 
110 115 120 

GAA GCT GAG CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT 



Glu Ala Glu Pro Cys Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala 
125 130 135 



AAC TAC TCT TGT GAA TGC CCA GGA GAA TTT ATG GGA CGA AAT TGT CAA 
Asn Tyr Ser Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin 
140 " 145 150 155 

TAT AAA. TGC TCT GGG CAC TTG GGA ATC GAA GGT GGG ATC ATA TCT AAT 
Tyr Lys Cys Ser Gly His Leu Gly He Glu Gly Gly He He Ser Asn 

160 165 170 

CAG CAA ATC ACA GCT TCA TCT AAT CAC CGA GCT CTT TTT GGA CTC GAG 
Gin Gin He Thr Ala Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin 

175 180 185 

AAG TGG TAT CCC TAC TAT GCT CGA CTT AAT AAG AAG GGC CTT ATA AAT 
Lys Trp Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn 
190 * 195 200 

GCC TGG ACA GCT GCT GAA AAT GAC AGA TGG CCA TGG ATT CAG ATA AAT 
Ala Trp Thr Ala Ala Glu Asn Asp Arg Trp Pro Trp He Gin He Asn 
205 210 215 

TTG CAA AGA AAA ATG AGA GTC ACT GGT GTT ATT ACC CAA GGA GCA AAA 
Leu Gin Arg Lys Met Arg Val Thr Gly Val He Thr Gin Gly Ala Lys 
220 " ^ 225 * 230 235 

AGG ATT GGA AGC CCA GAG TAC ATA AAA TCC TAC AAA ATT GCC TAC AGC 
Arg He Gly Ser Pro Glu Tyr He Lys Ser Tyr Lys He Ala Tyr Ser 

240 ^ 245 250 

AAT GAC GGG AAG ACC TGG GCA ATG TAC AAA GTA AAA GGC ACC AAT GAA 
Asn Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu 

255 260 265 

GAG ATG GTC TTT CGT GGA AAT GTT GAT AAC AAC ACA CCA TAT GCT AAT 
Glu Met Val Phe Arg Gly Asn Val Asp Asn Asn Thr Pro Tyr Ala Asn 
270 ~ 275 280 

TCT TTC ACA CCC CCA ATC AAA GCT CAG TAT GTA AGA CTC TAC CCC CAA 
Ser Phe Thr Pro Pro He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin 
285 290 295 

ATT TGT CGA AGG CAT TGT ACT TTA AGA ATG GAA CTT CTT GGC TGT GAG 
He Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu 
300 305 " 310 315 

CTC TCA GGC TGT TCA GAA CCT TTG GGG ATG AAA TCA GGG CAT ATA CAA 
Leu Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin 

320 325 330 

GAC TAC CAG ATC ACT GCC TCC AGC GTC TTC AGA ACA CTC AAC ATG GAC 
Asp Tyr Gin He Thr Ala Ser Ser Val Phe Arg Thr Leu Asn Met Asp 

335 340 345 

ATG TTT ACT TGG GAA CCA AGG AAA GCC AGG CTG GAC AAG CAA GGC AAA 
Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys 
350 355 360 

GTA AAT GCC TGG ACT TCC GGC CAT AAC GAC CAG TCA CAA TGG TTA CAG 
Val Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin 
365 370 375 



GTT GAT CTT CTT GTC CCT ACT AAG GTG ACA GGC ATC ATT ACA CAA GGA 
Val Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly 
380 385 390 395 

GCT AAA GAT TTT GGT CAC GTG CAG TTT GTT GGG TbA TAC AAA CTA GCT 
Ala Lys Asp Phe Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala 

400 405 410 

TAC AGC AAT GAT GGA GAA CAC TGG ATG GTG CAC CAG GAT GAA AAA CAG 
Tyr Ser Asn Asp Gly Glu His Trp Met Val His Gin Asp Glu Lys Gin 

415 420 425 

AGG AAA GAC AAG GTT TTT CAA GGC AAT TTT GAC AAT GAC ACT CAC AGG 
Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg 
430 435 440 

AAA AAT GTC ATC GAC CCT CCC ATC TAT GCA CGA TTC ATA AGA ATC CTT 
Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu 
445 450 455 

CCT TGG TCC TGG TAT GGA AGG ATC ACT CTG CGG TCA GAG CTG CTG GGC 
Pro Trp Ser Trp Tyr Gly Arg lie Thr Leu Arg Ser Glu Leu Leu Gly 
460 * 465 470 475 

TGC GCA GAG GAG GAA TGAAGTGCGG GGCCGCACAT CCCACAATGC TTTTCTTTAT 
Cys Ala Glu Glu Glu 

480 

TTTCCTATAA GTATCTCCAC GAAATGAACT GTGTGAAGCT GATGGAAACT GCATTTGTTT 
TTTTCAAAGT GTTCAAATTA TGGTAGGCTA CTGACTGTCT TTTTAGGAGT TCTAAGCTTG 
CCTTTTTAAT AATTTAATTT GGTTTCCTTT GCTCAACTCT CTTATGTAAT ATCACACTGT 
CTGTGAGTTA CTCTTCTTGT TCTCT 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser Leu Gly 
1 5 10 15 

Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro Cys Glu 

20 25 * 30 

Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe Ser Cys 
35 40 45 

Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val Val Glu 
50 55 60 

Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys lie Pro 
65 70 75 80 
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Asn 


Pro 


Cys 


His 


Asn 
85 


Gly 


Gly 


Thr 


Cys 


Glu 
90 


He 


Ser 


Glu 


Ala 


Tyr Arg 
95 




Gly 


Asp 


Thr 


Phe 
100 


He 


Gly 


Tyr 


Val 


Cys 
105 


Lys 


Cys 


Pro 


Arg 


Gly 
110 


Phe 


Asn 




Gly 


He 


His 
115 


Cys 


Gin 


His 


Asn 


He 
120 


Asn 


Glu 


Cys 


Glu 


Ala 
125 


Glu 


Pro 


Cys 




Arg 


Asn 
130 


Gly 


Gly 


He 


Cys 


Thr 
135 


Asp 


Leu 


Val 


Ala 


Asn 
140 


Tyr 


Ser 


Cys 


Glu 




Cys 
145 


Pro 


Gly 


Glu 


Phe 


Met 
150 


Gly 


Arg 


Asn 


Cys 


Gin 
155 


Tyr 


Lys 


Cys 


Ser 


Gly 
16 0 




His 


Leu 


Gly 


He 


Glu 
165 


Gly 


Gly 


He 


He 


Ser 
170 


Asn 


Gin 


Gin 


He 


Thr 
175 


Ala 


r ss& 


Ser 


Ser 


Asn 


His 
180 


Arg 


Ala 


Leu 


Phe 


Gly 
185 


Leu 


Gin 


Lys 


Trp 


Tyr 
190 


Pro 


Tyr 


?lj 


Tyr 


Ala 


Arg 
195 


Leu 


Asn 


Lys 


Lys 


Gly 
200 


Leu 


He 


Asn 


Ala 


Trp 
205 


Thr 


Ala 


Ala 


.•3 


Glu 


Asn 
210 


Asp 


Arg 


Trp 


Pro 


Trp 

JET 

215 


He 


Gin 


lie 


Asn 


Leu 
220 


Gin 


Arg 


Lys 


Met 


ii*? 


Arg 
225 


Val 


Thr 


Gly 


Val 


He 
230 


Thr 


Gin 


Gly 

-* 


Ala 


Lys 
235 


Arg 

— > 


He 


Gly 


Ser 


Pro 
240 


i: 

• » 

rss 
? s 


Glu 


Tyr 


He 


Lys 


Ser 
245 


Tyr 


Lys 


He 


Ala 


Tyr 
250 


Ser 


Asn 


Asp 


Gly 


Lys Thr 
255 * 


•= 3 
» ; j 

( := 
J : i 
rs .: 


Trp 


Ala 


Met 


Tyr 
260 


Lys 


Val 


Lys 


Gly 


Thr 
265 


Asn 


Glu 


Glu 


Met 


Val 
270 


Phe 


Arg 


:ii 

- est 

ijl 
• fa 


Gly 


Asn 


Val 
275 


Asp 


Asn 


Asn 


Thr 


Pro 
280 


Tyr 


Ala 


Asn 


Ser 


Phe 
285 


Thr 


Pro 


Pro 




lie 


Lys 
290 


Ala 


Gin 


Tyr 


Val 


Arg 
295 


Leu 


Tyr 


Pro 


Gin 


He 
300 


Cys 


Arg 

—* 


Arg 


His 




Cys 
305 


Thr 


Leu 


Arg 


Met 


Glu 
310 


Leu 


Leu 


Gly 


Cys 


Glu 
315 


Leu 


Ser 


Gly 


Cys 


Ser 
320 




Glu 


Pro 


Leu 


Gly 


Met 
325 


Lys 


Ser 


Gly 


His 


He 
330 


Gin 


Asp 


Tyr 


Gin 


He 
335 


Thr 




Ala 


Ser 


Ser 


Val 
340 


Pne 


Arg 


Thr 


Leu 


Asn 
345 


Met 


Asp 


Met 


Phe 


Thr 
350 


Trp 


Glu 




Pro 


Arg 


Lys 
355 


Ala 


Arg 


Leu 


Asp 


Lys 
360 


Gin 


Gly 


Lys 


Val 


Asn 

365 


Ala 


Trp 


Thr 




Ser 


Gly 
370 


His 


Asn 


Asp 


Gin 


Ser 
375 


Gin 


Trp 


Leu 


Gin 


Val 
380 


Asp 


Leu 


Leu 


Val 




Pro 
385 


Thr 


Lys 


Val 


Thr 


Gly 
390 


He 


He 


Thr 


Gin 


Gly 
395 


Ala 


Lys 


Asp 


Phe 


Gly 
400 




His 


Val 


Gin 


Phe 


Val 
405 


Gly 


Ser 


Tyr 


Lys 


Leu 
410 


Ala 


Tyr 


Ser 


Asn 


Asp 
415 


Gly 
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Glu His Trp Met 

420 

Phe Gin Gly Asn 
435 

Pro Pro lie Tyr 
450 

Gly Arg He Thr 
465 



Val His Gin Asp 



Phe Asp Asn Asp 

440 

Ala Arg Phe He 
455 

Leu Arg Ser Glu 
470 



Glu Lys Gin Arg 
425 

Thr His Arg Lys 



Arg He Leu Pro 

460 

Leu Leu Gly Cys 
475 



Lys Asp Lys Val 
430 

Asn Val He Asp 
445 

Trp Ser Trp Tyr 



Ala Glu Glu Glu 

480 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1779 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCT CTT TAG TCA CCA CTC TCG CCC TCT CCA AGA ATT TGT TTA AC A *AAG 
Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg He Cys Leu Thr Lys 
1 5 10 15 

CGC TGA GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT 
Arg * Gly Lys Arg Thr Ser Ser * He Leu * * Gly Arg Ser 

20 25 30 

CTG CTG CTG CCC TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG 
Leu Leu Leu Pro Cys Ala Ala Thr Ser Ala Thr Leu Pro Ser Ala Thr 
35 40 45 

ACC CCT GAC CAG CCG GGG TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC 
Thr Pro Asp Gin Pro Gly Ser Arg Pro Gly Asp Gly He Met Lys Arg 
50 55 60 

TCG GTA GCC GTC TGG CTC TTG GTC GGG CTC AGC CTC GGT GTC CCC CAG 
Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly Val Pro Gin 
65 70 75 80 

TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT 
Phe Gly Lys Gly Asp He Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly 

85 90 95 

ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA 
He Cys Leu Pro Gly Leu Ala Val Gly Ser Phe Ser Cys Glu Cys Pro 

100 105 110 

GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT GCA TCA 
Asp Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser 
115 * 120 125 
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GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA TGC 
Asp Glu Glu Glu Pro Thr Ser Ala Gly Pro Cys Thr Pro Asn Pro Cys 
130 135 140 

CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA ^AC CGA GGG GAT ACA 
His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Arg Gly Asp Thr 
145 150 155 160 

TTC ATA GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC 
Phe lie Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn Gly lie His 

165 170 175 

TGT CAG CAC AAC ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT 
Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cys Lys Asn Gly 

180 185 190 

GGA ATA TGT ACA GAT CTT GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC 
Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu Cys Pro Gly 
195 200 205 

GAA TTT ATG GGA AGA AAT TGT CAA TAC AAA TGC TCA GGC CCA CTG GGA 
Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly Pro Leu Gly 
210 215 220 

ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA ATC ACA GCT TCC TCT ACT 
lie Glu Gly Gly lie lie Ser Asn Gin Gin lie Thr Ala Ser Ser Thr 
225 230 235 240 

CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC TAC TAT GCA CGT 
His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr Tyr Ala Arg 

245 * 250 * ~ 255. 

CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA AAT GAC 
Leu Asn Lys Lys Gly Leu lie Asn Ala Trp Thr Ala Ala Glu Asn Asp 

260 ' 265 ~ 270 

AGA TGG AAG CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 
Arg Trp Lys Arg Trp lie Gin lie Asn Leu Gin Arg Lys Met Arg Val 
275 ~ 280 " 285 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT 
Thr Gly Val lie Thr Gin Gly Ala Lys Arg He Gly Ser Pro Glu Tyr 
290 295 300 

ATA AAA TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA 
lie Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys Thr Trp Ala 
305 310 " 315 320 

ATG TAC AAA GTG AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC 
Met Tyr Lys Val Lys Gly Thr Asn Glu Asp Met Val Phe Arg Gly Asn 

325 330 335 

ATT GAT AAC AAC ACT CCA TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA 
He Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro He Lys 

340 345 350 

GCT CAG TAT GTA AGA CTC TAT CCC CAA GTT TGT CGA AGA CAT TGC ACT 
Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg His Cys Thr 
355 360 365 

TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG TCG GGT TGT TCT GAG CCT 
Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cys Ser Glu Pro 
370 375 380 
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CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG ATC ACT GCC TCC 
Leu Gly Met Lys Ser Gly His lie Gin Asp Tyr Gin lie Thr Ala Ser 
385 390 395 400 

AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC AfcT TGG GAA CCA AGG 
Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu Pro Arg 

405 410 415 

AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 
Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr Ser Gly 

420 425 430 

CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC 
His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val Pro Thr 
435 440 445 

AAA GTG ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA 
Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe Gly His Val 
450 455 460 

CAG TTT GTT GGC TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC 
Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp Gly Glu His 
465 470 " ^ 475 ~ ^ 480 

TGG ACT GTA TAC CAG GAT GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG 
Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys Val Phe Gin 

485 490 495 

GGA AAT TTT GAC AAT GAC ACT CAC AGA AAA AAT GTC ATC GAC CCT CCC 
Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asp Pro Pro 

500 505 510 

ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT TGG TCC TGG TAC GGG AGG 
lie Tyr Ala Arg His lie Arg lie Leu Pro Trp Ser Trp Tyr Gly Arg 
515 520 525 

ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG GAG GAA TGA GGG 
lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu Glu * Gly 
530 535 540 

GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT ATC TCC 
Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie Ser 
545 550 ~ 555 * 560 

ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 
Met Glu * Thr Val * Asn Leu * Glu Thr Glu Trp Phe Phe Phe 

565 570 575 

TTT TCA TGA AAA AGT GGT CAA ATT ATG GTA GGC AAC TAA CGG TGT TTT 
Phe Ser * Lys Ser Gly Gin lie Met Val Gly Asn * Arg Cys Phe 

580 585 590 

TAC C 
Tyr 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



-82- 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Pro Leu Ser Pro Ser Pro Arg lie Cy^ Leu Thr Lys Arg 
1 5 10 14 

INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Gly Lys Arg Thr Ser Ser 
1 5 



INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRI] 

Gly Arg Ser Leu Leu 
1 5 

Ser Ala Thr Thr Pro 

20 

Met Lys Arg Ser Val 
35 

Val Pro Gin Phe Gly 
50 

Asn Gly Gly lie Cys 
65 

Glu Cys Pro Asp Gly 

85 

Val Ala Ser Asp Glu 

100 

Asn Pro Cys His Asn 
115 

Gly Asp Thr Phe lie 
13 0 



TION: SEQ ID NO: 14: 

Leu Pro Cys Ala Ala Thr 

10 

Asp Gin Pro Gly Ser Arg 

25 

Ala Val Trp Leu Leu Val 
40 

Lys Gly Asp lie Cys Asp 
55 

Leu Pro Gly Leu Ala Val 
70 " 75 

Phe Thr Asp Pro Asn Cys 

90 

Glu Glu Pro Thr Ser Ala 

105 

Gly Gly Thr Cys Glu lie 
120 

Gly Tyr Val Cys Lys Cys 
135 



Ser Ala Thr Leu Pro 

15 

Pro Gly Asp Gly lie 
30 

Gly Leu Ser Leu Gly 
45 

Pro Asn Pro Cys Glu 
60 

Gly Ser Phe Ser Cys 

80 

Ser Ser Val Val Glu 

95 

Gly Pro Cys Thr Pro 
110 

Ser Glu Ala Tyr Arg 
125 

Pro Arg Gly Phe Asn 
140 
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Gly lie His Cys Gin His Asn He Asn Glu Cys Glu Val Glu Pro Cys 
145 150 155 160 

Lys Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 

165 " 170 * 175 

Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly 

180 185 190 

Pro Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin He Thr Ala 
195 200 205 

Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr 
210 215 220 

Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr Ala Ala 
225 230 235 240 

Glu Asn Asp Arg Trp Lys Arg Trp He Gin He Asn Leu Gin Arg Lys 

245 250 255 

Met Arg Val Thr Gly Val He Thr Gin Gly Ala Lys Arg He Gly Ser 

260 265 270 

Pro Glu Tyr He Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys 
275 280 285 

Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Asp Met Val Phe 
290 295 300 

Arg Gly Asn He Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro 
305 ^ 310 315 320 

Pro He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg 

325 330 335 

His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cys 

340 345 * 350 

Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin Asp Tyr Gin He 
355 360 365 

Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp 
370 375 380 

Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp 
385 ~ 390 * 395 400 

Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu 

405 410 415 

Val Pro Thr Lys Val Thr Gly He He Thr Gin Gly Ala Lys Asp Phe 

420 425 430 

Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp 
435 440 445 

Gly Glu His Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys 
450 455 " 460 

Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val He 
465 " 470 ~ " 475 480 
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Asp Pro Pro lie Tyr Ala Arg His lie Arg lie Leu Pro Trp Ser Trp 

485 490 * 495 

Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu 

500 505 * 510 

Glu 

INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie 
1 5 10 15 

Ser Met Glu 



INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu Thr Glu Trp Phe Phe Phe Phe Ser 
1 ~ 5 



INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Ser Gly Gin lie Met Val Gly Asn 
1 5 

INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Cys Phe Tyr A 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GACAGATGGC CATGGATTCA GATAAATTTG CAAAGAAAAA TGAGAGTCAC TGGTGTTATT 60 
AC C C AAGG AG CAAAAAGGAT TGGAAGCCCA GAGTACATAA AATCCTACAA AATTGCCTAC 120 



I-. ? 



'H AGCAATGACG GGAAGACCTG GGCAATGTAC AAAGTAAAAG GCACCAATGA AGAGATGGTC 180 

iSl 

m 

\s% TTTCGTGGAA ATGTTGATAA CAACACACCA TATGCTAATT CTTTCACACC CCCAATCAAA 24 0 

GCTCAGTATG TAAGACTCTA CCCCCAAATT TGTCGAAGGC ATTGTACTTT AAGAATGGAA 3 00 

n CTTCTTGGCT GTGAGCTC ., 318 

(2) INFORMATION FOR SEQ ID NO: 20: 



1** 



Ljl (i) SEQUENCE CHARACTERISTICS: 

d| (A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Cys Ser Thr Gin Leu Gly Met Glu Gly Gly Ala lie Ala Asp Ser Gin 
15 10 15 

lie Ser Ala Ser Tyr Val Tyr Met Gly Phe Met Gly Leu Gin Arg Trp 

20 25 30 

Gly Pro Glu Leu Ala Arg Leu Tyr Arg Thr Gly lie Val Asn Ala Trp 
35 40 45 

His Ala Ser Asn Tyr Asp Xaa Ser Lys Pro Trp lie Gin Val Asn Leu 
50 55 60 

Leu Arg Lys Met Arg Val Ser Gly Val Met Thr Gin Gly Ala Ser Arg 
65 ^ 70 75 80 

Ala Gly Arg Ala Glu Tyr Leu Lys Thr Phe Lys Val Ala Tyr Ser Leu 
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Asp Gly Xaa Arg Lys 

100 

Lys Glu Phe Leu Gly 
115 

Phe Asn Pro Thr Leu 
130 

Cys His Arg Gly Cys 
145 

His Gly Cys Leu Glu 

165 

Ser Gin Met Ser Ala 

180 

Phe Gly Trp Tyr Pro 
195 

Asn Ala Trp Thr Ala 
210 

Asp Leu Gly Thr Gin 
225 

Arg Asp Phe Gly His 

245 

Ser Asp Asp Gly Val 

260 

Ser Ser Lys Val Phe 
275 

Asn lie Phe Glu Lys 
290 

Val Ser Trp His Asn 
305 



90 

Phe Glu Phe He Gin Asp 

105 

Asn Leu Asp Asn Asn Ser 
120 

Glu Ala Gin Tyr He Arg 
135 

Thr Leu Arg Phe Glu Leu 
150 155 

Pro Leu Gly Leu Lys Asn 

170 

Ser Ser Ser Tyr Lys Thr 

185 

His Leu Gly Arg Leu Asp 
200 

Gin Ser Asn Ser Ala Lys 
215 

Arg Gin Val Thr Gly He 
230 235 

He Gin Tyr Val Glu Ser 

250 

Gin Trp Thr Val Tyr Xaa 

265 

Gin Gly Asn Leu Asp Asn 
280 

Pro Phe Met Ala Arg Tyr 
295 

Arg He Thr Leu Arg Leu 
310 ~ 315 



95 

Glu Ser Gly Gly Asp 
110 

Leu Lys Val Asn Met 
125 

Leu Tyr Pro Val Ser 
140 

Leu Gly Cys Glu Leu 

16 0 

Asn Thr He Pro Asp 

175 

Trp Asn Leu Arg Ala 
190 

Asn Gin Gly Lys He 
205 

Glu Trp Leu Gin Val 
220 

He Thr Gin Gly Ala 

240. 

Tyr Lys Val Ala His 

255 

Xaa Glu Glu Gin Gly 
270 

Asn Ser His Lys Lys 
285 

Val Arg Val Leu Pro 
300 

Glu Leu Leu Gly Cys 

320 



INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Ser Gly Pro Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin 
15 10 15 
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He Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp 

20 25 30 

Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp 
35 40 45 

Thr Ala Ala Glu Asn Asp Arg Trp Asn Arg Trp He Gin lie Asn Leu 
50 " 55 60 

Gin Arg Lys Met Arg Val Thr Gly Val He Thr Gin Gly Ala Lys Arg 
65 70 75 80 

He Gly Ser Pro Glu Tyr He Lys Phe Tyr Lys He Ala Tyr Ser Asn 

85 " 90 95 

Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Asp 

100 105 110 

Met Val Phe Arg Gly Asn He Asp Asn Asn Thr Pro Tyr Ala Asn Ser 
115 " 120 125 

Phe Thr Pro Pro He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val 
130 135 140 

Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu 
145 ~ 150 155 160 

Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin Asp 

165 * 170 175 

Tyr Gin He Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Asp Met 

180 185 190 

Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val 
195 200 " 205 

Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val 
210 215 220 

Xaa Leu Leu Val Pro Thr Lys Val Thr Gly He He Thr Gin Gly Ala 
225 230 235 " 240 

Lys Asp Xaa Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr 

245 250 255 

Ser Asn Asp Gly Glu His Trp Thr Val Xaa Gin Asp Glu Lys Gin Arg 

260 265 270 

Lys Asp Lys Val Xaa Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys 
275 280 285 

Asn Val He Asp Pro Pro He Tyr Ala Arg His He Arg He Leu Pro 
290 " 295 300 

Trp Ser Trp Tyr Gly Arg He Thr Leu Ala Ser Glu Leu Leu Gly Cys 
305 310 315 320 

Thr 



INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Lys Arg Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly 
15 10 15 

Val Pro Gin Phe Gly Lys Gly Asp He 

20 25 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 



: -a 

hi 

'•4 



Li 



5 i 



VS. 3 
flj 

J 



(ii) MOLECULE TYPE: peptide 



H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly He Cys Leu Pro Gly Leu 
1 5 10 15 

Ala Val Gly Xaa Xaa Xaa Xaa Xaa Ser Phe Ser Cys Glu Cys Pro Asp 

20 25 ' 30 

Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser Asp 
35 40 45 

Glu Glu Glu Pro Thr Ser Ala Gly Pro 
50 55 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu He Ser Glu 
15 10 15 
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Ala Tyr Arg Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cys Pro Arg 

20 25 30 

Gly Phe Asn Gly lie His Cys Gin His Asn fie 
3 5 " 40 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



w 

'■-.A 



IS 2 

i ; ■: 
rfl-ss 



is ? 



ra 



{ 5l 
■ is 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Cys Glu Val Glu Pro Cys Lys Asn Gly Gly lie Cys Thr Asp Leu Val 
1 5 10 15 

Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tyr Ser Cys Glu Cys Pro Gly 

20 25 30 

Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Xaa Xaa Xaa Pro Cys Xaa Asn Gly Gly Xaa Cys Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Cys Xaa Cys Xaa Xaa 

20 25 30 

Gly Tyr Xaa Gly Xaa Xaa Cys Xaa 
35 40 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..309 y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
NGTGATATTT GTGATCCCAA TCCATGTGAA AATGGAGGTA TCTGTTTGCC AGGATTGGCT 
GTAGGTTCCT TTTCCTGTGA GTGTCCAGAT GGCTTCACAG ACCCCAACTG TTCTAGTGTT 
GTGGAGGTTG GTCCCTGCAC TCCTAATCCA TGCCATAATG GAGGAACCTG TGAAATAAGT 
GAAGCATACC GAGGGGATAC ATTCATAGGC TATGTTTGTA AATGTCCCCG AGGATTTAAT 
GGGATTCACT GTCAGCACAA CATAAATGAA TGCGAAGTTG AGCCTTGCAA AAATGGTGGA 
ATATGTACAG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 550.,. 1212 
(D) OTHER INFORMATION: 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 28: 




GAATTCCGGG 


AGGGAGGGTA 


GGGGGGCGGG 


CCGCGGGGGC 


CCAAAGCCAG 


CTAGGCTCAG 


TCTCACACGC 


GCGCCGCCAC 


TGTTTGTATA 


TAGTGCGCTC 


CTGGCCTCAG 


GCTCGCTCCC 


CTCCAGCTCT 


CGCTTCATTG 


TTCTCCAAGT 


CAGAAGCCCC 


CGCATCCGCC 


GCGCAGCAGC 


GTGAGCCGTA 


GTCACTGCTG 


GCCGCTTCGC 


CTGCGTGCGC 


GCACGGAAAT 


CGGGGAGCCA 


GGAAC CCAAG 


GAGCCGCCGT 


CCGCCCGCTG 


TGCCTCTGCT 


AGACCACTCG 


CAGCCCCAGC 


CTCTCTCAAG 


CGCACCCACC 


TCCGCGCACC 


CCAGCTCAGG 


CGAAGCTGGA 


GTGAGGGTGA 


ATCACCCTTT 


CTCTAGGGCC 


ACCACTCTTT 


TATCGCCCTT 


CCCAAGATTT 


GAGAAGCGCT 


GCGGGAGGAA 


AGACGTCCTC 


TTGATCTCTG 


ACAGGGCGGG 


GTTTACTGCT 


GTCCTGCAGG 


CGCGCCTCGC 


CTACTGTGCC 


CTCCGCTACG 


ACCCCGGACC 


AGCCCAGGTC 


ACGTCCGTGA 



GAAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG GTT GGA CTC AGC 
Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser 
1 5 10 

CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC CCG AAC CCC 
Leu Gly Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro 
15 20 25 30 
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TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT GAT TCC TTT 687 

Cys Glu Asn Gly Gly He Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe 

35 40 45 

TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG ^AAC TGC TCT AGT GTT 73 5 

Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val 

50 55 60 

GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA GGT CCC TGC 7 83 

Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys 
65 70 75 

ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA AGC GAA GCC 831 

He Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu He Ser Glu Ala 
80 85 90 

TAT CGA GGA GAC ACA TTC ATA GGC TAT GTT TGT AAA TGT CCT CGG GGA" 879 

Tyr Arg Gly Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg Gly 
95 ~ 100 105 110 

!=] TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT GAA GCT GAG 927 

?|i Phe Asn Gly He His Cys Gin His Asn He Asn Glu Cys Glu Ala Glu 

m 115 120 125 



CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT AAC TAC TCT 9 75 

Pro Cys Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser 



l2 TGT GAA TGC CCA GGA GAA TTT ATG GGA CGA AAT TGT CAA TAT AAA TGC 1023 

Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys 
145 150 155 



t X 



M TCT GGG CAC TTG GGA ATC GAA GGT GGG ATC ATA TCT AAT CAG CAA ATC 1071 

m Ser Gly His Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin He 

\'i 160 165 170 

41 ACA GCT TCA TCT AAT CAC CGA GCT CTT TTT GGA CTC CAG AAG TGG TAT 1119 

tQ Thr Ala Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr 

175 180 185 190 

CCC TAC TAT GCT AGA CTT AAT AAG AAG GGC CTT ATA AAT GCC TGG ACA 1167 
Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr 

195 200 205 

GCT GCT GAA AAT GAC AGA TGG CCA TGG ATT CAG GTA ACA GTG GGA TGAGA 1217 
Ala Ala Glu Asn Asp Arg Trp Pro Trp He Gin Val Thr Val Gly 

210 215 220 

CAAATCCATT TCCCAAATTA TCAGAATCAT TATAGAAGTA GGTTAGGGAG AATTGGCTGT 1277 

GATTCTTTCT CATGGTTAAA ATGTGATTTA GTTCAGAATT AACATGGTTG GAAACTCTAA 1337 

AAAATGTGGA AAACAGGAAC ATTCTATGTC TGAAAATCTG AAAATAGCAT CAAGATGAAA 13 97 

ACATTCTTTA GTCATAAATA TACTCTTTTA AGTTATAGTA GAGAAAAAGA TCTTATCATT 14 57 

TCATAAGTGG ACTTTTGGGA TAGCATTGGA AATGTAAATG AAATAAATAC CTAATTGAAA 1517 

AAAGTTTATT CTAAAGTGTT AATATTTAGC AACAGATTCA GAGACAAGAA AGTAACAATT 1577 

CAATCTGTGT ATTTTTTGTG AGAAATAGTT TCCCATGTGC AAATATAAAG TGCGCATCAT 163 7 

ATCATGATAA TATCCAACTG TCTGCAGAAC TCCCTTTCAT AAATGAGAGA ATTTTAATTC 16 97 
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ATAGTGCCTT 


ATATCCTCAT 


CAGCCATCTG 


ACTTTACTAC 


AGAAGAAAAC 


AATGAAATGA 


1757 


TGCATTAAGT 


GCTTTGCTAG 


AAGAAACATC 


ATAGCAAAGC 


TGATAGCCCA 


CATTCTGTGC 


1817 


ANNNAAGCTT 


CCAGAGCACT 


CGAGAAAAAG 


CAGAAATGAG 


* ATGTTTTATG 


K 

AAAACCGAAA- 


1877 


AGATAATCTG 


ATTTCTGTGA 


AATATACTTT 


TGATCATGTG 


GTTCTTTAAG 


ATAGTCACTA 


1937 


ACAAGTCATT 


AGTAGCAGAT 


ACCAAATGGG 


AGAAAATTTC 


CAGTATACTG 


AGGGTCAAGG 


1997 


CAGTCATGCT 


GAAACTACAT 


GAGGTCAGGA 


AAGTTTTGAA 


ATAAGGTGAT 


TTTGGAAGGA 


2057 


TACCTTCAAC 


TGGCCTAGAT 


TTTCAAGAAA 


CAGTGTAATC 


AACAGCCAAA 


CATGAGAATC 


2117 


TAGCTAACAG 


CATTTAGAAA 


ACCAGAACTA 


AGAGTGTTAC 


TGGGGAATTG 


CATTTAAATC 


2177 


CAGTATGAGA 


GTTTGCAAAT 


GCCGTATTCT 


TCTAAGGGGT 


TTGTGCCACA 


TTTTGTTACC 


2237 


ATGGAGTCCT 


CTGTAAGAAC 


TTTATTAGAT 


AAATCATCTT 


TACACTATAA 


TTTGAATAAA 


2297 


AGCCGGAATT 


C 










2308 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
7 H (A) LENGTH: 221 amino acids 
#13 (B) TYPE: amino acid 

m (C) STRANDEDNESS : single 

71 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser Leu Gly 
1 5 10 15 

Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro Cys Glu 

20 25 30 

Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe Ser Cys 
35 40 " 45 

Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val Val Glu 
50 55 60 

Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys lie Pro 
65 *" 70 75 80 

Asn Pro Cys His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Arg 

85 90 95 

Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn 

100 105 110 

Gly lie His Cys Gin His Asn lie Asn Glu Cys Glu Ala Glu Pro Cys 
115 120 125 

Arg Asn Gly Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 
130 135 140 

Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly 
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145 

His Leu Gly lie Glu 

165 

Ser Ser Asn His Arg 

180 

Tyr Ala Ala Leu Asn 
195 

Glu Asn Asp Arg Trp 
210 



150 155 

Gly Gly lie lie Ser Asn 

170 

Ala Leu Phe Gly Leu Gin 

185 

Lys Lys Gly Leu lie Asn 
200 

Pro Trp lie Gin Val Thr 
215 



160 

Gin Gin lie Thr Ala 

175 

t 

Lys Trp Tyr Pro Tyr 
190 

Ala Trp Thr Ala Ala 
205 

Val Gly 
220 



-94- 



